A High-level Morphological Description Language Exploiting Inflectional Paradigms
نویسندگان
چکیده
A high-level language lor the description of inflectional morphology is presented, in which the organization of word lormation rules into an ii~herilance hierarchy of paradigms allows l o r a natural encoding of the kinds of nfles typically pre~uted in grammar txroks. We show how tim language, composed of orthographic rides, word formation rules, and paradigm inheritance, can be compiled into a run-time data structure for efficient morphological analysis and generation with a dynamic secondary storage lexicon.
منابع مشابه
A Paradigm-Based Finite State Morphological Analyzer for Marathi
A morphological analyzer forms the foundation for many NLP applications of Indian Languages. In this paper, we propose and evaluate the morphological analyzer for Marathi, an inflectional language. The morphological analyzer exploits the efficiency and flexibility offered by finite state machines in modeling the morphotactics while using the well devised system of paradigms to handle the stem a...
متن کاملLanguage Learning ISSN 0023-8333 Introduction. Beyond the Obvious: Do Second Language Learners Process Inflectional Morphology?
Given that this special issue is devoted to the acquisition and processing of inflectional morphology by second language (L2) learners, the question in the title may appear redundant. However, recent research on first language (L1) and L2 morphological processing has challenged basic assumptions about the status of inflectional morphology in linguistic processing that had long been taken for gr...
متن کاملAssigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction
This paper describes how a supervised learning method is used for assigning inflectional paradigms to organizational named entities as the main prerequisite for generating a morphological lexicon of these entities. An inflectional paradigm consists of a set of rules for generating all forms of a lexicon entry. A morphological lexicon consists of lexicon entries and their corresponding forms. Th...
متن کاملA Non-parametric Model for the Discovery of Inflectional Paradigms from Plain Text Using Graphical Models over Strings
The field of statistical natural language processing has been turning toward morphologically rich languages. These languages have vocabularies that are often orders of magnitude larger than that of English, since words may be inflected in various different ways. This leads to problems with data sparseness and calls for models that can deal with this abundance of related words—models that can le...
متن کاملDiscovering Morphological Paradigms from Plain Text Using a Dirichlet Process Mixture Model
We present an inference algorithm that organizes observed words (tokens) into structured inflectional paradigms (types). It also naturally predicts the spelling of unobserved forms that are missing from these paradigms, and discovers inflectional principles (grammar) that generalize to wholly unobserved words. Our Bayesian generative model of the data explicitly represents tokens, types, inflec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1992